Methods for Recovery of Missing Speech Packets
نویسندگان
چکیده
In packetized voice communication, speech packets are sometimes lost due to data transmission problems, e.g., signal fading, or interfering users and noise. For the recovery of missing speech packets, different methods are proposed. This thesis analyzes some recovery methods, and four variants of a waveform substitution method used during the objective analysis. This method is based on slow varying speech parameter estimates. These parameters include the short time energy (STE) and the zero crossing(ZC) measure. This technique is implemented in two different ways based on the slow varying parameters. These parameters are stored in the previous packet. If a speech packet is lost, it is recovered by the information stored in the previous packets. Both implementations differ only in the use of the zero crossing information. The short time energy estimation is the same in both implementations. A slight modification is made in these two implementations where the estimated speech parameters are stored in the previous and in future packets in order to recover two consecutive packets. This modification is applied only if the speech signal is already saved at the transmitter because it requires the future packets to store the information of previous packets, i.e., a non-causal solution.However, a causal solution is obtained if the signal is allowed to be delayed by one packet. The speech quality of the reconstructed speech signal is analyzed and compared between the four implementations.The implementation of these methods has been validated by subjectively observing the recovered speech packets, and by considering the improvement of the objective measures mean opinion score(MOS), mean square error (MSE) and signal-to-noise ratio (SNR). The recovery of samples within the packets is also discussed. The recovery of samples within a packet is done by the Fast Fourier Transform(FFT) block code method. The FFT block code method is implemented by an iterative algorithm. This method is validated by subjective observations and improvements in objective measures mean square error (MSE) and signal to noise ratio (SNR). The VAD is also used for the waveform substitution method and in the introduction of channel noise. After subjective observations and objective measures, it is concluded that modified method A provides better performance for the recovery of speech packets and the FFT block code method has been validated for recovering the samples within a packet.
منابع مشابه
Model-based multirate representation of speech signals and its application to recovery of missing speech packets
When the samples of a critically sampled speech signal are lost, objectionable aliasing occurs and perfect recovery of the original speech becomes impossible. In this work, a multirate state-space representation of the autoregressive (AR) speech process is derived to describe the generation of regularly missingsample speech sequences. Next, a new sample-interpolation algorithm based on the mult...
متن کاملروشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه
Performance of speech recognition systems is greatly reduced when speech corrupted by noise. One common method for robust speech recognition systems is missing feature methods. In this way, the components in time - frequency representation of signal (Spectrogram) that present low signal to noise ratio (SNR), are tagged as missing and deleted then replaced by remained components and statistical ...
متن کاملA New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)
Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...
متن کاملPacket Recovery in High-Speed Networks Using Coding and Buffer Management
Traditional data reliability techniques such as retransmissions can result in intolerable storage requirements and data delay when they are used in gigabit wide-area networks. This paper presents a novel technique based on forward error correction (FEC) that allows the destination to reconstruct missing data packets by using redundant parity packets that the source adds to each block of data pa...
متن کاملPacket loss recovery and control for voice transmission over the Internet
\Best e ort" packet-switched networks, like the Internet, do not o er a reliable transmission of packets to applications with real-time constraints such as voice. Thus, the loss of packets impairs the application-level utility. For voice this utility impairment is twofold: on one hand, even short bursts of lost packets may decrease signi cantly the ability of the receiver to conceal the packet ...
متن کامل